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Electrocardiogram (ECG) is the most common method for monitoring the 
working of the heart. ECG signal is the basis to determine normal or 
abnormal rhythm, thereby helping to accurately diagnose cardiovascular 
diseases. Therefore, an automatic algorithm to detect and diagnose abnormal 
heart rhythms is essential. There are many methods of classifying 


arrhythmias using machine learning algorithms such as k-nearest neighbors 
(KNN), support vector machines (SVM), based on the features extracted 
from the record of ECG signal. Actually, deep learning algorithms are 
evolving and highly effective in image analysis and processing. In this 
research, a dense neural network model is proposed to classify normal and 
abnormal beats. Input ECG signal presenting a time series is converted into 
2-D spectral image by applying wavelet transform. Our research is evaluated 
based on using the Massachusetts Institute of Technology-Beth Israel 
Hospital (MIT-BIH) arrhythmia database. The accuracy of the classification 
algorithm we employ is 99.8%, demonstrating the model's validity when 
compared to other reports' findings. This is the foundation for our algorithm 
to prove it can be utilized as an efficient model for categorizing arrhythmia 
using ECG signals. 
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1. INTRODUCTION 

An arrhythmia [1] is an electrical irregularity of the heart, which can be a pacing or electrical 
conduction anomaly in the heart chambers, in which the heartbeat is irregular, too fast or too slow. An 
arrhythmia can be asymptomatic or cause symptoms such as palpitations, a sense that the heart is beating too 
quickly or irregularly, or a break between heartbeats [2]. Many cases of severe arrhythmias cause the patient 
to become dizzy, faint, have trouble breathing, and have chest pain. Complications can occur such as stroke, 
heart failure, or sudden death. According to WHO [3], cardiovascular diseases are the cause of the largest 
mortality in the world (more than 30%), higher than death from cancer. It is estimated that each year about 
17.9 million people worldwide die from cardiovascular diseases of which 85% are from heart attack and 
stroke. Especially in the current situation of COVID-19 epidemic, the risk of death often focuses mainly on 
the elderly or patients with underlying medical conditions including cardiovascular disease. 

Electrocardiogram (ECG) is a chart that records the electrical impulses generated by cardiac muscle 
cell through electrodes placed in the body. The ECG signals are displayed in a 1-D time series that helps 
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track and detect irregularities in the heart rhythm based on the waveform and the frequency of the heartbeat. 
Electrocardiograms can be used to diagnose cardiovascular problems in individuals. Electrocardiogram 
reading is a difficult task that needs experience and training. The specialist can evaluate if a clinical symptom 
of a heart problem is present based on the recorded data. As a result, identifying cardiac arrhythmias is 
mostly dependent on the knowledge of the doctor, and various doctors will provide different outcomes. 
Furthermore, with a lengthy time interval ECG record, young medical practitioners may overlook mild 
signals of cardiovascular illness. As a result, we require a tool to assist clinicians in the analysis of ECGs. As 
a result, we require a tool to assist clinicians in the analysis of ECGs. In which one of the key factors for 
properly diagnosing heart-related illnesses is the categorization of abnormal heart beats. 

The ECG signal is a 1-D time series that can be processed and analyzed automatically by machine 
learning algorithms. Furthermore, deep learning algorithms have recently been demonstrated to be extremely 
efficient in the processing and categorization of 2D images. Deep learning algorithms, which are a subset of 
machine learning, rely on data to understand how to solve problems. Deep learning employs the neural 
network, a multi-layered structure of algorithms. Artificial neural networks offer unique characteristics that 
allow deep learning models to accomplish tasks that machine learning models have limitations. 

There have been several studies in the subject of automated categorization arrhythmias. Wave 
morphological characteristics [4]-[6], as well as parameters such as variance and standard deviation [7] [8], 
have been extracted from 1-D ECG signals in the past, with the use of machine learning techniques such as 
KNN, support vector machine, decision tree, and random forest [9]-[11]. In order to extract the features or 
normalize data [7] most correctly, these techniques require a signal preprocessing step to filter noise, filter 
baseline drift [1]-[9]. Deep learning approaches are increasingly being used in image processing and analysis 
with great efficiency and accuracy [12]-[16]. The neural network model may operate effectively with 
multidimensional inputs without the feature extraction step. However, because the output of a 1-D input 
signal is less reliable than a 2-dimensional input, deep learning models often use a 2D picture as their input. 
A previous study [17] utilized a picture of the ECG signal that had not been transformed, which obtained 
99.21% accuracy rate. The input ECG (1-D) time series signal may be converted into a 2-D spectral picture 
using transformation techniques. Some recent research used a transformed 2D spectral image as the input of 
neural network classification model for 3 classes classification [15] with the accuracy of 98.7%, and 8 classes 
classification [16] achieving an accuracy of 99.11%. 

Our contribution focuses primarily on approaches for extracting characteristics from an ECG signal 
and then doing classification using standard machine learning models, which yielded encouraging results. In 
addition to obtaining features in the time domain [4]-[6], some approaches employ transformation algorithms 
such as the Fourier transform and the wavelet transform to extract more characteristics of the signal in the 
frequency domain [7]-[9]. However, if linear features are present, performing feature extraction is extremely 
difficult and might reduce the classification model's effectiveness. Furthermore, if the database size is huge, 
standard machine learning methods will not attain the optimum efficiency. Heartbeat classification 
approaches based on deep learning algorithms have recently been presented as a solution to this challenge. 
The input processing of the neural network is also taken into consideration, in addition to the usage of multi- 
layer neural network models with superior image classification efficiency. Only information about the 
waveforms is obtained when the classification model's input is an image of a 1D ECG signal [17], and this 
information is locally represented on the image, which means that, aside from the morphology of the signal, 
the remaining intervals on the image contain no information. As a result, several approaches have 
transformed a 1D ECG signal into a 2D spectral picture using transformation algorithms [15], [16]. The 
signal's temporal and frequency domain information are both contained in 2D spectral images. Clipping the 
signal segments at specific intervals from the beginning to the conclusion of the signal, on the other hand, 
might produce unevenness in the classification of the beats in the 2D pictures. We suggest a new approach in 
this study that is based on the evolution of earlier methods, which are: 

a) Equally cut the signal segments by taking the same interval on both sides of the R peaks. 

b) Using the continuous wavelet transform (CWT) to convert the signal segment information into 2D 
spectral pictures from the clipped signal segments. 

c) The dense neural network model is utilized to identify heartbeats using these images as input. 

d) Our research paper is organized. Part II explains our research method. Part III presents experimental 
results, and Part IV concludes the paper. 


2. RESEARCH METHOD 
2.1. Databases 

In this study, we evaluate the effectiveness of our algorithm based on using the MIT-BIH 
arrhythmia database [1] which is published on Physionet.org. The database includes 48 ECG records, each is 
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slightly more than 30 minutes long. Inside the database, each record of each different patient is bandwidth 
filtered at the frequency range of 0.1-100 Hz and digitized at a frequency of 360 Hz. The records were 
labeled with the R peaks and the position of the peaks that appeared to be an arrhythmia. Therefore, the 
effectiveness of our classification model can be assessed. The three components of an ECG are depicted in 
Figure 1. The P wave represents atria depolarization; the QRS complex, which represents ventricular 
depolarization; and the T wave, which shows ventricle repolarization. 
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Figure 1. ECG of a heart in normal sinus rhythm 


2.2. Block diagram 

Figure 2 shows the implementation of the algorithm. The ECG signal is classified into two classes: 
normal and abnormal. Firstly, we use the package waveform-database (WFDB) for loading ECG and 
annotations from the dataset in Keras framework in Python. Each type is identified by a symbol that 
corresponds to the number of beats. The signal's peak R may be determined using the peak's characteristics. 
However, we utilize R peak value labeled in the dataset for signal processing simplicity. The second stage is 
signal segmentation, which involves taking an equal time-series signal before and after R peaks. Wavelet 
transformation is used to convert these segmented ECG signal intervals from time series to 2-dimensional 
spectral pictures. In this step, we use the scalogram tool to represent the transformed images. This image 
dataset is then used as the input of the classification deep neural network model. The image is divided into 
training sets and validation sets to perform classification and evaluate our model. 
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Figure 2. Arrhythmia detection diagram 
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2.3. Proposed method 
2.3.1. Pre-processing 

To convert the signal ECG into images, we first need to output the R peak in the signal records, 
which is used to represent a heartbeat. We detect R peaks by signal's detecting peaks algorithm from the 
Scipy package based on properties of ECG signal's peak. The algorithm performs finding all local maxima in 
the data series by simply comparing neighboring values. Then select R peaks as a subset of these peaks based 
on the conditions of the peak's properties. The ECG signal in the data set has a sampling frequency of 360 
Hz, so the signal will be represented in the time domain by time index unit with each time index equal to 
1/360 s. In our case, we detect R peaks by choosing the maximal value of the ECG signal in the minimum 
horizontal distance of 150 indexes between neighboring peaks. Figure 3 presents the R peaks of the ECG 
signal. 
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Figure 3. R peaks detected by the scipy package 


In this study, for simplicity, we utilize R peaks value, which has already been positioned in the 
database. From the data set, 9000 heartbeats are randomized with equal numbers of normal and irregular 
beats, for a total of 4500 beats. From the position of R peak of each beat, it will go backward and forward to 
each side a signal interval of the length equal 200 indexes. 


2.3.2. Generation of 2-D spectral images 

Theoretically, any signal can be decomposed into its component signals in both the temporal and the 
frequency domain. Therefore, the ECG signal can be analyzed into component signals to determine when and 
at what frequency the arrhythmia occurs [18]. The wavelet transform fulfills these two requirements. It 
makes the continuous signal x(t) from one dimension in two a 2D space defined as (1), 


Sab) = fis x(Qejat (1) 


where a and b are the scale factor and shift translation applied in the continuous parent wavelet (t). In this 
step, the continuous wavelet transform (CWT) is applied to generate an ECG signal into a 2D spectrum. 
Depending on the study goal, several types of wavelet transforms can be employed to analyze ECG signals. 
For example, to remove ECG baseline, we use five wavelet transform families with a total of 14 wavelet 
configurations: Daubechies, Coiflets, Symlets, Fejer-Korovkin, and Meyer [19]. The variation of the 
abnormal heartbeat is a non-stationary signal so it is suitable to choose Morlet as the mother wavelet because 
of its analysis application on discriminate arrhythmias in the ECG signal [20]. 

In theory, the Morlet wavelet is the most popular complex wavelet used in practice [21] and is 
defined as (2), 


1 ; w 
W(t) = gg E —e 2 ez (2) 


where wọ is the central frequency of the mother wavelet and set by default for each wavelet with respective 
value. The second term in the bracket is correct for the non-zero mean of the complex sinusoid of the first 
term and can be negligible if wg>S. 

With regards to the scale factor, the size of the picture's height has an impact on the resolution of 2D 
spectral pictures. On the other side, the signal length corresponds to the width size of spectral images. We 
chose the features condition that offer the output pictures size with the best classification accuracy by running 
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the experiment multiple times with varied sizes of 2D output images and comparing the classification results 
in part 3. 

The scale factor is selected to transform linearity from 1 to 150. The scale represents the number of 
times the wavelet is stretched. The larger scale, the more stretched wavelet is, and the more sensitive it is to 
lower signal frequencies. For better visualization, the scalogram is used to generate and show the 2D 
spectrum for the CWT. The CWT coefficients of a signal are taken in absolute value and its graph is plotted. 
Figure 4 presents an ECG signal and its scalogram. 
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Figure 4. An ECG signal and its scalogram 
In the scalogram output, the period in the vertical is defined by (3), 
Period = = (3) 


where s is the scale, and b (wọ) is the central frequency used to build the chosen wavelet. Each horizontal 
feature may be regarded as a frequency of the total signal, and there is no continuous line in the output image 
to indicate that the frequencies are not time-consistent. The scalogram is a two-dimensional picture of 
150x401 pixels, with 150 representing the number of scales used in the wavelet transform and 401 being the 
number of indices in the ECG signal data. 


2.3.3. Deep neural network construction 

Data setup: Our prepared dataset now is divided into the training set and test set with a 3/1 ratio; the 
train/test splits are generated to ensure that there is no overlap between the two sets. 

Model construction: After the ECG signal is converted into a spectral image, each image will be 
assigned a value of 0 (normal peak) or 1 (abnormal peak). To classify these images, we use a basic 
convolutional neural network. The size of the model input is a x 150 x 401 where a is the number of 150x401 
pixel images entered into the model. The hidden layer is constructed by two dense layers with 500 and 100 
nodes separately. The output layer has two neurons for the final classifier that are either 0 (normal peak) or 1 
(abnormal peak). 

Activation function: The activation function may be used to compute the output of each node in an 
artificial neural network given a collection of image inputs. The rectified linear activation function (ReLU) is 
used in the hidden layer and is suggested as the default for multilayer perceptron (MLP) and convolutional 
neural networks (CNNs) [22]. If the input is positive, it will be directly output; otherwise, it will be zero. We 
use the Softmax activation function in the output layer to generate a vector of classification probabilities, 
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with the probabilities of each value proportional to the relative scale of each value (0 and 1) in the vector 
using (4), 
E (4) 

o(z)i = woe 
where K is the number of classes. 

By applying the standard exponential function to each element z; and normalizes these values by 
dividing by the sum of all these exponentials, it ensures that each component will be in the interval [0,1] and 
the sum of components output vector z is 1. After that, the class with higher probabilities value will be 
labeled as the image’s type. Figure 5 presents the dense neural network (DNN) model architecture. 
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Figure 5. The dense neural network (DNN) model architecture 


Cost function: The goal of the cost function is to compromise the accuracy of the algorithm, by 
taking the average error between the prediction result and the performance result. In theory, there are a 
variety of cost functions that can be used. In our paper, we choose sparse categorical cross-entropy as cost 
function because it saves memory and computation time. Instead of using an entire vector, it just utilizes a 
single integer. The cross-entropy loss between the labels and our results is calculated with the (5), 


C= FEM (le #ln In (ae) +(1- ye) In In (1 — ac) J) 6) 


where C is the cost to be minimized, n is the number of training points, y is the target value, N is the number 
of the classes, c is the index of the class, and a is the actual value. We use the stochastic gradient descent 
(SGD) optimizer for training our model. It evaluates the error gradient for the current state of the model using 
the training dataset, then updates the weights of our model via backpropagation. 


3. RESULTS AND DISCUSSION 
3.1. Classification result 

The two parameters in our method that affect the result directly are scale of wavelet transform and 
the interval of signal for each spectral image. The large scale can offer the model high sensitivity, but it takes 
a long time to classify the data and the speed of the process is very slow. The signal segment intervals are 
similar. Long interval carries more information of ECG signal, but it also takes more time and decreases the 
speech of the process. With the capacity of our setup system, we have to tradeoff between the scales and the 
interval values. 

Table 1 represents the parameters and the corresponding testing time. We have the best accuracy 
with a CWT scale of 150 and an ECG signal segment interval of 401 indices. The high sensitivity of the 
transformation model is shown by the scale value of 150. The signal interval from R peaks is 401 indexes 
before and after that there is adequate information in the present period and comparing the current period to 
the before and after periods. 

The accuracy of the model is 99.8% after running 10 epochs using an optimizer of stochastic 
gradient descent and computing the loss with a sparse categorical cross-entropy. In the last step, we use the 
confusion matrix to evaluate our model. The confusion matrix allows us to evaluate the classification model 
visually. Each row represents the actual or true value, and each column represents the predicted value; we 
next compare the actual and predicted values for each class. The diagonal values are the biggest in an 
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efficient model, equivalent to the number of predicted values equal to the actual value. The values in the 
matrix are then normalized to a range of 0 to 1, with 1 being the desired value in the diagonal cells. Figure 6 
presents the confusion matrix for the proposed classification model. 


Table 1. Table of parameters and results in testing times 

Scale Interval Samples Accuracy 

50 401 9,000 99.16% 

100 201 9,000 99.51% 

100 201 9,000 99.73% 

100 601 9,000 99.69% 

150 201 9,000 99.20% 

150 401 9,000 99.81% 
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T i 
normal abnormal 
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Figure 6. Confusion matrix for the proposed classification model 


Our model has a confusion matrix with cells in diagonal equal 1, that show the amounts of actual 
abnormal beats and predicted abnormal beats almost similar. The learning curve in Figure 7 shows the 
effectiveness of this classification model. The learning curve shows the graphs of the value of loss function 
and accuracy of training set and validation set during the classification time. In our model, we receive good 
results in both training set and validation set. After the first epoch, the accuracy of the training set and 
validation set are very high and stable, approximately 1. Contrary to the accuracy, the values of the loss 
function of two sets are very low and stable after the first epoch, approximately 0. That proof our model is 
not overfitting or underfitting. 
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Figure 7. Learning curves 


3.2. Discussion 

To recognize the effectiveness of the method which uses wavelet transform to convert ECG signal 
to 2- D spectral images then classification by dense neural network model, we compare it with other methods 
also detecting arrhythmias and using MIT-BIH database. We compared the model of automatic arrhythmia 
classification based on ECG signaling with other recent models as shown in Table 2. Our average accuracy, 
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sensitivity, specificity, precision value, in turn, reached 99.8%, 99.7%, 99.8%, 99.8% respectively, 
demonstrated superior performance when compared to previous algorithms that produced two classes in the 
five initial models. Our model has the highest average accuracy of the algorithms compared. This shows the 
superiority of deep learning algorithms compared to machine learning algorithms [9], [23], [24]. With other 
models using CNN or long short term memory (LSTM) [25], general sparsed neural network (GSNN) [26], 
radial basis function (RBF) [27], our model has better results, as our input to neural network model is 2D 
spectral images which have information in both time and frequency domain. One of the possible factors 
affecting the final result is the lesser number of heartbeats we utilize for training and testing our deep neural 
network model. When we develop our model to be a multiclass classification model, the results of seven 
models with output of more than two classes show that our model may be a good improvement. 

The detection of arrhythmias on hourly long ECG records is time-consuming and requires the 
examiner to pay close attention. It is feasible to improve the performance of medical professionals by guiding 
the observer to analyze noticeable anomalies using automated categorization methods. As a result, the 
diagnosis and treatment of cardiovascular disorders in the clinic may be done faster and more efficiently. 


Table 2. Comparison between the proposed model and other state-of-the-art ECG classification techniques 


Years Model Class Accuracy % Specificity % Sensitivity % Precision% F1 score 
2018 SVM [23] 2 96% - - - - 
2018 KNN [9] 2 97.5% - - - - 
2019 CNN [25] 2 97.2% 98.7% 93.8% 96.8% - 
2019 LSTM [25] 2 71.4% 50.1% 93.6% 64.2% - 
2019 SVM [24] 2 98.3% 97.5% 99.1% - 98.3% 
2021 Proposed model (DNN) 2 99.8% 99.8% 99.7% 99.8% - 
2020 GSNN [26] 5 98% - - 98% 98% 
2016 SVM-RBF [27] S 98.91% 97.85% 98.91% - - 
2019 Faster R-CNN [17] 5 99.21% 99.45% 98.06% - - 
2020 CNN [28] 5 98.33% 99.09% 98.33% 98.34% - 
2020 LSTM [29] 5 99.37% 99.14% 94.89% 96.73% 95.77% 
2019 CNN [30] 5 99% - - - - 
2020 CNN [16] 8 99.11% 99.61% 97.91% 98.58% 98% 
2021 CNN [15] 3 98.7% - - - - 


4. CONCLUSION 

In this paper, we show how to use a dense neural network model to detect arrhythmias from ECG 
data recordings. An accurate taxonomy of ECG signals provides an excellent foundation for cardiovascular 
disease diagnosis and prognosis. Our approach is unique in that it uses the Wavelet transform to turn a one- 
dimensional ECG signal into two-dimensional spectral pictures, which are then used as input to a 
classification model. When compared to methods that integrate feature extraction and current machine 
learning technologies, the neural network model has shown beneficial in enhancing the accuracy of heartbeat 
diagnoses. 
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